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@ The transmitter (1) in a digital transmission sys- 
tem derives from a wideband digital signal Sbb hav- 
ing a sample frequency a second digital signal 
appearing on its output (7), which signal comprises 
successive frames, each frame being made up of 
information packets (IP) each having a length of N 
bits. The number of information packets (B) in a 
frame is determined as follows: 
If P in the formula 

BR Hg 

P = X — 

N Fg 

r- 

^ is an integer, where 

(vj BR is the bit rate of the second digital signal, 

rsHs is the number of samples of the wide-band digital 

^signal, 

^ Sbb in which the information, after having been con- 
Overted into the second digital signal, is present in 
'■"one frame of the second digital signal, 
©the number of information packets B in a frame is 
£^ equal to P. If P is not an integer, B is equal to P' for 
Ilia number of frames, p' being the ne)ct lower integer 
following P, whereas B for the other frame is equal 
to P' + I, in such a way that the average frame rate 



of the second digital signal is substantially equal to 
F^/n,. 

A frame comprises a first frame portion (FD1), a 
second frame portion (FD2), and a third frame por- 
tion (FD3). The first frame portion contains synch- 
ronising information and system information. The 
second frame portion contains allocation information, 
and the third frame portion contains samples of and, 
if applicable, scale-factor Information for the second 
digital siignal. 
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Digital transmission system, transmitter and receiver for use in tiie transmission system, and record 
carrier obtained by means of the transmitter in the form of a recording device. 



The invention relates to a digital transmission 
system comprising a transmitter and a receiver, for 
transmitting a wide-band digital signal of a specific 
sample frequency Fg, for example a digital audio 
signal, via a transmission medium, and for receiv- 
ing said signal, the transmitter having an input 
terminal for receiving the wide-band digital signal, 
which input terminal is coupled to an input of a 
signal source which forms part of the transmitter 
and which is constructed to generate a second 
digital signal and supply said signal to an output, 
which second digital signal comprises consecutive 
frames, each frame comprising a plurality of in- 
formation packets, each Information packet com- 
prising N bits, N being larger than 1 , the receiver 
comprising a decoder having an input for receiving 
the second digital signal, which decoder has an 
output coupled to an output terminal to supply the 
wide-band digital signal. The invention also relates 
to a transmitter and a receiver for use in the 
transmission system, to a transmitter in the form of 
a device for recording the second digital signal in a 
track on a record carrier, to a record carrier ob- 
tained by means of the transmitter, and to a re- 
ceiver in the fonm of a device for reading the 
second digital signal from the track on the record 
can-ier. A transmission system of the type defined 
in the opening sentence is known from the article 
"The Critical Band Coder - Digital Encoding of 
Speech signals based on the Percentual require- 
ments of the Auditory System" by M.E. Krasner in 
Proc. IEEE IGASSP 80, Vol.l, pp.327-331, April 9- 
11, 19B0. This article relates to a transmission 
system In which the transmitter employs a subband 
coding system and the receiver employs a cor- 
responding subband decoding system, but the in- 
vention is not limited to such a coding system, as 
will become apparent hereinafter. 

In the system known from said publication the 
speech signal band is divided into a plurality of 
subbands whose bandwidth approximately corre- 
sponds to the bandwidths of the critical bands of 
the human ear in the respective frequency ranges 
(cf. Fig. 2 in the article of Krasner). This division 
has been selected because on the ground of 
psycho-acoustic experiments It is foreseeable that 
the quantisation noise in such a subband will be 
masked to an optimum extent by the signals in this 
subband if in the quantisation allowance is made 
for the noise-masking curve of the human ear (this 
curve gives the threshold value for noise masking 
in a critical band by a single tone in the centre of 
the critical band, cf. Fig.3 in the article by Krasner). 

In the case of a high-quality digital music sig- 



nal, which in conformity with the Compact Disc 
Standard is represented by 1 6 bits per signal sam- 
ple in the case of a sample frequency of 1/T = 
44.1 kHz, it is found that with a suitably selected 

5 bandwidth and a suitably selected quantisation for 
the respective subbands the use of this known 
subband-coding system yields quantised output 
signals of the coder which can be represented by 
an average number of approximately 2.5 bits per 

JO signal sample, the quality of the replica of the 
music signal not differing perceptibly from that of 
the original music signal in substantially all pas- 
sages of substantially all kinds of music signals. 
The subbands need not necessarily correspond 

75 to the bandwidths of the critical bands of the hu- 
man ear. Alternatively, the subbands may have 
other bandwidths, for example they may all have 
the same bandwidth, provided that allowance is 
made for this in determining the masking threshold. 

20 It is an object of the invention to provide a 

number of steps for the transmission system, in 
particular a very specific choice for the format with 
which the digital wide-band signal, after conversion 
into the second digital signal, can be transmitted 

25 via the transmission medium, in such away that a 
flexible and highly versatile transmission system is 
obtained. This is to be understood to mean that the 
transmitter should be capable of converting wide- 
band digital signals of different formats (which tor- 
so mats differ inter alia in respect of the sample 
frequency Fs of the wide-band digital signal, which 
may have different values such as 32 kHz, 44.1 
kHz and 48 kHz, as laid down in the digital audio 
interface standard of the AES and the EBU) into 

35 the second digital signal. Similarly, the receiver 
should be capable of deriving a wide-band signal of 
the correct format from said second digital signal. 
To this end the transmission system in accordance 
with the invention is characterized in that if P in the 

40 formula 

BR ng 
P = — X ~ 
4S N Fg 

is an integer, where 

BR is the bit rate of the second digital signal, and 
55 ns is the number of samples of the wideband digital 
signal whose corresponding information, which be- 
longs to the second digital signal, is included in 
one frame of the second digital signal, the number 
of information packets B in one frame is P, and in 
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that, if P is not an integer, the number of informa- 
tion pacl<ets in a number of the frames is p', P' 
being the next lower integer following P, and the 
number of information pacl<ets in the other frames 
is equal to P' + 1 so as to exactly comply with the 
requirement that the average frame rate of the 
second digital signal should be substantially equal 
to FS/ns and that a frame should comprise at least 
a first frame portion including the synchronising 
Information. The purpose of dividing the frames 
into B information packets is that for a wide-band - 
digital signal of an arbitrary sample frequency Fs 
the average frame rate of the second digital signal 
transmitted by the transmitter is now such that the 
duration of a frame in the second digital signal 
corresponds to the duration occupied by ns sam- 
ples of the wide-band signal. Moreover, this en- 
ables the synchronisation to be maintained on an 
information-packet basis, which is simpler and 
more reliable than maintaining the synchronisation 
on a bit basis. Thus, in those cases where P is not 
an integer, the transmitter is capable, at instants at 
which this possible and also necessary, to provide 
a frame with P' + 1 Instead of P' information blocks, 
so that the average frame rate of the second digital 
signal can be maintained equal to Fsins- Since In 
this case the spacing between the synchronising 
information (synchronising signals or synchronising 
words) included in the first frame portion of suc- 
ceeding frames is also an integral multiple of the 
length of an information packet it remains possible 
to maintain the synchronisation on an Information 
packet basis. Preferably, the first frame portion 
further contains information related to the number 
of information packets in a frame. In a frame com- 
prising B information packets this information may 
be equal to the value B. This means that this 
information corresponds to P' for frames compris- 
ing p' information packets and to P + 1 for frames 
comprising P' + 1 information packets. Another pos- 
sibility is that this information corresponds to P for 
all frames, reqardless of whether a frame com- 
prises P' or P' + 1 information packets. The addi- 
tionally inserted (P +1)th infonnfiation packet may 
comprise for example merely "zeros". In that case 
this infomnation packet does not contain any useful 
information. Of course, the additional information 
packet may also be filled with useful information. 
The first frame portion may further comprise sys- 
tem information. This may include the sample fre- 
quency Fs of the wide-band digital signal applied to 
the transmitter, copy-protection codes, the type of 
wide-band digital signal applied to the transmitter, 
such as a stereo-audio signal or a mono-audio 
signal, or a digital signal comprising two substan- 
tially independent audio signals. However, other 
system information is also possible, as will become 
apparent hereinafter. Including the system informa- 



tion makes it possible for the receiver to be also 
flexible and enables the received second digital 
signal to be correctly reconverted into the wide- 
band digital signal. The second and the third frame 

5 portions of a frame contain signal information. The 
transmitter may comprise a coder comprising 
signal-splitting means responsive to the wide-band 
digital signal to generate a second digital signal in 
the form of a number of M subsignals, M being 

10 larger than 1 , and comprising means for quantising 
the respective sub signals. For this purpose an 
arbitrary transform coding, such as the fast Fourier 
transfomn (FFT) may be used. In that case the 
transmission system is characterized in that the 

75 second frame portion of a frame contains allocation 
information which, for at least a number of sub 
signals, indicates the number of bits representing 
the samples of the quantised subsignals derived 
from said subsignals, and in that the third frame 

20 portion contains the samples of at least said quan- 
tised subsignals (if present). At the receiving end it 
is then necessary to apply an inverse transfonn 
coding, for example an inverse Fourier transform 
(IFFT), to recover the wide-band digital signal. The 

25 transmission system, in which the signal-splitting 
means take the form of analysis-filter means re- 
sponsive to the wide-band digital signal to generate 
a number of M subband signals, which analysis- 
filter means divide the signal band of the wide- 

30 band digital signal, using a sample-frequency re- 
duction, into successive subbands having band 
numbers m increasing with the frequency, and in 
which the quantisation means are adapted to quan- 
tise the respective subband signals block by block, 

35 is a system employing subband coding as de- 
scribed above. Such a transmission system is char- 
acterized further in that for at least a number of the 
subband signals the allocation information in the 
second frame portion of a frame specifies the num- 

40 ber of bits representing the samples of the quan- 
tised subband signals derived from said subband 
signals and in that the third frame portion contains 
the samples of at least said quantised subband 
signals (If present. This means in fact that the 

45 allocation infomnation is inserted in a frame before 
the samples. This allocation information is needed 
to enable the continuous serial bit stream of the 
samples in the third frame portion to be subdivided 
into the various individual samples of the correct 

50 number of bits at the receiving end. The allocation 
information may require that all samples are repre- 
sented by a fixed number of bits per subband per 
frame, this is referred to as a transmitter based on 
fixed or static bit allocation. The allocation informa- 

55 tion may also imply that a number of bits variable 
in time Is used for the samples in a subband. This 
Is referred to as a transmitter based on the system 
of adaptive or dynamic bit allocation. Fixed and 
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adaptive bit allocation are described inter alia in the 
publication "Low bit-rate coding of higlPquality 
audio signals. An introduction to the MASCAM sys- 
tem" by G. Theile et al, EBU Technical Review, 
No. 230 (August 1988). Inserting the allocation in- 
formation in a frame before the samples in a frame 
has the advantage that at the receiving end a 
simpler decoding becomes possible, which can be 
carried out in real time and which produces only a 
slight signal delay. As a result of this sequence it is 
no longer necessary to first store all the information 
in the third frame portion In a memory in the 
receiver. Upon arrival of the second digital signal 
the allocation information is stored in a memory in 
the receiver. Information content of the allocation 
information is much smaller than the Information 
content of the samples in the third frame portion, 
so that a substantially smaller store capacity is 
needed than in the case that all the samples would 
have to be stored in the receiver. Immediately 
upon arrival of the serial data stream of the sam- 
ples in the third frame portion this data stream can 
be divided into the various samples having the 
number of bits specified by the allocation informa- 
tion, so that no previous storage of the signal 
information is necessary. The allocation information 
for all the subbands can be included In a frame. 
However, this is not necessary, as will become 
apparent hereinafter. 

The transmission system may be characterized 
further in that in addition the third frame portion 
includes information related to scale factors, a 
scale factor being associated with at least one of 
the quantised subband signals contained in the 
third frame portion, and In that the scale factor 
Information is included In the third frame portion 
before the quantised subband signals. The samples 
can be coded in the transmitter without being nor- 
malised, Le. without the amplitudes of a block of 
samples in a subband having been divided by the 
amplitude of the sample having the largest am- 
plitude in this block. In that case no scale factors 
have to be transmitted. If the samples are nor- 
malised during coding scale factor information has 
to be transmitted to provide a measure of said 
largest amplitude. If In this case the scale factor 
information is also Inserted In the third frame por- 
tion before the samples it is possible that during 
reception to the scale factors to be derived from 
said scale information are first stored in a memory 
and the samples are multiplied immediately upon 
arrival, Le. without a time delay, by the inverse 
values of said scale factors. The scale factor in- 
fonnation may be constituted by the scale factors 
themselves. It is obvious that a scale factor as 
inserted in the third frame portion may also be the 
inverse of the amplitude of the largest sample in a 
block, so that in the receiver it is not necessary to 



determine the inverse value and consequently de- 
coding can be faster. Alternatively, the values of 
the scale factors may be encoded prior to insertion 
in the third frame portion as scale factor informa-. 

s tion and subsequent transmission. Moreover, it is 
evident that if after quantisation in the transmitter 
the subband signal in a subband is zero, which 
obviously will be apparent from the allocation in- 
formation for the subband, no scale factor informa- 

10 tion for this subband has to be transmitted. The 
transmission system, in which the receiver com- 
prises a decoder comprising synthesis-filter means 
responsive to the respective quantised subband 
signals to construct a replica of the wide-band 

75 digital signal, which synthesis-filter means combine 
the subbands applying sample-frequency increase 
to form the signal band of the wide-band digital 
signal, may be characterized in that the samples of 
the subband signals (if present) are inserted in the 

20 third frame portion in a sequence corresponding to 
the sequence in which said samples are applied to 
the synthesis-filter means upon reception in the 
receiver. Inserting the samples in the third frame 
portion in the same sequence as that in which they 

25 are applied to the synthesis-filter means In the 
receiver also results in fast decoding, which again 
does not require additional storage of the samples 
in the receiver before they can be further pro- 
cessed. Consequently, the storage capacity re- 

30 quired in the receiver can be limited substantially 
to the storage capacity needed for the storage of 
the system information, the allocation information 
and, if applicable, the scale factor Information. 
Moreover, a limited signal delay is produced, which 

35 is mainly the result of the signal processing per- 
formed upon the samples. The allocation informa- 
tion for the various quantised subband signals is 
suitably Inserted in the second frame portion in the 
same sequence as that in which the samples of the 

40 subband signals are Included in the third frame 
portion. The same applies to the sequence of the 
scale factors. If desired, the frames may also be 
divided into four portions, the first, the second and 
the third frame portion being as described herein- 

45 before. The last (fourth) frame portion in the frame 
may then contain error-detection and/or error-cor- 
rection information. Upon reception of this informa- 
tion in the receiver it is possible to apply a correc- 
tion for errors produced in the second digital signal 

50 during transmission. As already stated, the wide- 
band digital signal may be a monophonic signal. 
Alternatively, the wide-band digital signal may be a 
stereo audio signal made up of a first (left) and a 
second (right) channel component. If the transmis- 

55 sion system is based on a subband-coding system 
the transmitter will supply subband signals each 
comprising a first and a second subband-signal 
component, which after quantisation In the quan- 
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tisation means are converted to fornn first and sec- 
ond quantised subband signai components. In tills 
case ttie frames siiould also include allocation in- 
formation and scale-factor information Of the sam- 
ples have been scaled in the transmitter). The 
sequence is also important here. Such a transmis- 
sion system is therefore defined in the appended 
Claims 11 to 15. It is obvious that the system can 
be extended to handle a wide-band digital signal 
comprising more than two signal components. 

The inventive steps may be applied to digital 
transmission systems, for example systems for the 
transmission of digital audio signals (digital audio 
broadcast) via the ether. However, other uses are 
also conceivable. An example of this is a transmis- 
sion via optical or magnetic media. Optical-media 
transmissions may be, for example, transmissions 
via glass fibres or by means of optical discs or 
tapes. Magnetic-media transmissions are possible, 
for example, by means of a magnetic disc or a 
magnetic tape. The second digital signal Is then 
stored in the format as proposed by the invention 
in one or more tracks of a record carrier, such as 
an optical or magnetic disc or a magnetic tape. 
The versatility and flexibility of the transmission 
system thus resides In the special format with 
which the information in the form of the second 
digital signal is transmitted, for example via a 
record carrier. This is combined with the special 
construction of the transmitter which is capable of 
generating this special format for various types of 
input signals. The transmitter generates the system 
information required for every type of signal and 
inserts this information in the data stream to be 
transmitted. At the receiving end this Is achieved 
by means of a specific receiver, which extracts 
said system information from the data stream and 
employs it for a correct decoding. 

The information packets are then constitute a 
kind of fictitious units, which are used to define the 
length of a frame. This means that they need not 
be explicitly discernible in the information stream 
of the second digital signal. Moreover, the relation- 
ship of the information packets with the existing 
digital audio interface standard is as defined in the 
lEC standard no. 958. This standard as normally 
applied to consumer products defines frames con- 
taining one sample of both the left-hand and the 
right-hand channel of a stereo signal. These sam- 
ples are represented by means of 16-bit two's 
complement words. If N = 32 is selected, one 
frame of this digital audio interface standard can 
transmit exactly one information packet of the sec- 
ond digital signal. In the digital audio interface 
standard the frame rate Is equal to the sample rate. 
For the present purpose the frame rate should be 
selected to be equal to BR/N. This enables the 
present ICs employed in standard digital audio 



interface equipment to be used. 

Embodiments of the invention will now be de- 
scribed in more detail, by way of example, with 
reference to the Figure. In the Figures 
5 Fig. 1 shows the second digital signal gen- 

erated by the transmitter and made up of frames, 
each frame being composed of information pack- 
ets, 

Fig. 2 gives the structure of a frame, 
10 Hg. 3 shows the structure of the first frame 

portion of a frame. 

Fig. 4 gives an example of the transmission 

system, 

Fig. 5 is a table specifying the number of 
75 information packets B in a frame for specific values 
of the bit rate BR and the sample frequency Fs, 

Fig. 6 gives the number of frames in a 
padding sequence and a number of frames thereof 
comprising an additional information packet 
20 (dummy slot) for a number of values of the bit rate 
BR, 

Fig. 7 represents the system information in- 
cluded in the first frame portion of a frame, 

Flg.8 illustrates the distribution of tlie digital 
26 Information about the various (two) channels for a 
number of modes, 

Fig. 9 illustrates the significance of the al- 
location information as Inserted in the second 
frame portion, 

30 Figs. 10 and 11 Illustrate the sequence in 

which the allocation information is stored in the 
second frame portion for two formats, format A and 
format B respectively. 

Fig. 12 shows an example of a receiver, 
35 Fig. 13 shows a transmitter in the form of a 

device for recording the second digital signal on a 
magnetic record carrier, 

Fig.14 shows the receiver in the form of a 
device for reproducing the second digital signal 
40 from a magnetic record carrier. 

Figs. 15a to 15d show some further possibili- 
ties of including the scale factors and samples In 
the third frame portion of a frame, 

Rg. 16 shows a further modification of the 
45 transmitter. 

Fig. 17 shows another structure of the first 
frame portion of a frame. 

Fig. 18 shows the system information in- 
cluded in the first frame portion illustrated in Fig. 
50 17, 

Figs. 19 and 20 show in more detail the 
information in the first frame portion illustrated in 
Fig. 17, 

Figs. 21 and 22 illustrate the sequence in 
55 which the allocation information is accommodated 
In the second frame portion associated with the 
first frame portion of Fig. 17, 

Fig. 23 gives the structure of a frame filled 
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with an additional signal, 

Fig, 24 illustrates how the scale factors are 

derived, 

Rg. 25 illustrates the quantisation of the 
scaled samples to form q-bit digital representa- 
tions, and 

Fig. 26 Illustrates the dequantisation of the 
q-bit digital representations. 

Fig. 1 shows diagrammatically the second digi- 
tal signal as generated by the transmitter and 
transmitted via the transmission medium. The sec- 
ond digital signal tal<es the form of the serial digital 
data stream. The second digital signal comprises 
frames, two such frames, i.e. the frame j and the 
frame j + being given in Fig.la. The frames, such 
as the frame j, comprise a plurality of Information 
packets IP1, IP2, IPS, see Fig.lb. Each informa- 
tion packet,such as IPS, comprises N bits bo, bi, 
bs, bti-u see Fig. 1c. The number of Information 
packets in a frame depends upon 

(a) the bit rate BR with which the second 
digital signal is transmitted via the transmission 
medium, 

(b) the number of bits N In an information 
packet, N being larger than 1 , 

(c) Fs. being the sample frequency of the 
wide-band digital signal, and 

(d) the number of samples of the wide- 
band digital signal, the information which corre- 
sponds thereto and which after conversion In the 
transmitter belongs to the second digital signal 
being included in one frame in the following man- 
ner. 

The parameter P is computed in conformity 
with the following formula 

BR Hg 
P = — X — 
N Fg 

If this computation yields an integer for P the- 
number of information packets B in a frame will be 
equal to P. If the computation does not result in an 
integer some frames will comprise P' information 
packets and the other frames will comprise P' + 1 
information packets, p' is the next lower integer 
following P. The number of frames comprising P' 
and P +1 information packets is obviously select- 
ed In such away that the average frame rate is 
equal to F„/„s. Hereinafter it is assumed that N=32 
and ns=384. The table in Fig.5 gives the number 
of information packets (slots) in one frame for these 
values for N and ns and for four values of the bit 
rate BR and three values for the sample frequency 
Fs. It is evident that for a sample frequency Fg 
equal to 44.1 kHz the parameter P is not an integer 



In all cases and that consequently a number of 
frames comprise 34 information packets and the 
other frames comprise 35 information packets 
(when BR is 128 kbit/s). This is also illustrated in 

5 Fig. 2. Fig. 2 shows one frame. The frame com- 
prises P' information packets IPl, IP2 IP p'. 

Sometimes a frame comprises P' + l information 
packets. This is achieved by assigning an addi- 
tional Information packet (dummy slot) to the 

70 frames of P' information packets. The second col- 
umn of the table of Flg.6 gives the number of 
frames in the padding sequence for a sample fre- 
quency of 44.1 kHz and the aforementioned four bit 
rates. The third column specifies those frames of 

75 said number of frames in the sequence which 
comprise P + 1 information packets. By subtracting 
the numbers in the second and the third column 
from each other this yields the number of frames in 
the sequence comprising p' information packets. 

20 The (P +1)th information packet then need not 
contain any information. The (P' + 1)th information 
packet may then comprise for example only ze- 
roes. It is obvious that the bit rate BR is not 
necessarily limited to the four values as given in 

25 the tables of Figs. 5 and 6. Other (for example 
intermediate) values are also possible. Fig. 2 shows 
that a frame comprises three frame portions FD1, 
FD2 and FD3 in this order. The first frame portion 
FD1 contains synchronising information and sys- 

30 tem information. The second frame portion FD2 
contains allocation information. The third frame por- 
tion FD3 contains samples and, when applicable, 
scale factors of the second digital signal. For a 
further explanation it is necessary to first describe 

35 the operation of the transmitter in the transmission 
system in accordance with the invention. 

Fig.4 shows diagrammatically the transmission 
system comprising a transmitter 1 having an input 
terminal 2 for receiving the wide-band digital signal 

40 Sbb. which may be for example a digital audio 
signal. In the case of an audio signal, this may be a 
mono signal or a stereo signal, in which case the 
digital signal comprises a first (left channel) and a 
second (right channel) signal component. It is as- 

45 sumed that the transmitter comprises a coder for 
subband coding of the wide-band digital signal and 
that the receiver consequently comprises a sub- 
band decoder for recovering the wide-band digital 
signal. The transmitter comprises analysis filter 

50 means 3 responsive to the digital wide-band signal 
Sbb to generate a plurality of M subband signals 
SsBi to SsBM. which analysis filter means divide the 
signal band of the wide-band signal Sbb with 
sample-frequency reduction into successive sub- 

55 bands having band numbers M (1 g m S M), which 
Increase with the frequency. All these subbands 
may have the same bandwidth but, alternatively, 
the subbands may have different bandwidths. In 
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that case the subbands may correspond, for exam- 
ple, to the bandwidths of the critical bands of the 
human ear. The transmitter further comprises 
means for the block-by-block quantisation of the 
respective subband signals. These quantisation 
means are shown in the block bearing the refer- 
ence numeral 9 in Rg. 4. 

Such a subband coder is known per se and is 
described inter alia in the aforementioned publica- 
tions by KrasneFand by Theile et al. Reference is 
also made to the published European Patent Ap- 
plication 289,080 (PHN.12.108). 

For a further description of the operation of the 
subband coder reference is made to said publica- 
tions. These publications are therefore incorporated 
herewith by reference. Such a subband coder en- 
ables a significant data reduction to be achieved, 
for example a reduction from 16 bits per sample 
for the wide-band digital signal Sbb to for example 
4 bits per sample in the signal which is transmitted 
to the receiver 5 via the transmission medium 4, 
see Fig. 4. Above Hs is assumed to be 384. This 
means that there are blocks of 384 samples of the 
wide-band digital signal, each sample having a 
length of 16 bits. Now it is also assumed that M = 
32. Consequently, the wide-band digital signal is 
split into 32 subband signals in the analysis filter 
means 3. Now 32 (blocks of) subband signals ap- 
pear on the 32 outputs of the analysis filter means, 
each block comprising 12 samples (the subbands 
have equal width) and each sample having a length 
of 16 bits. This means that on the outputs of the 
filter means 3 the information content is still equal 
to the information content of the block of 384 
samples of the signal Sbb on the input 2. The 
means 9 now provide data reduction in that, using 
the knowledge about masking, the samples in the 
32 blocks of 12 samples, each block for one sub- 
band, are quantised more roughly and can thus be 
represented by a smaller number of bits, in the 
case of a static bit allocation all the samples per 
subband per frame are expressed in a fixed num- 
ber of bits. This number can be different for two or 
more subbands but It can also be equal for the 
subbands, for example equal to 4 bits. In the case 
of dynamic bit allocation the number of bits se- 
lected for every subband may differ viewed in time, 
so that sometimes an even larger data reduction or 
a higher quality with the same bit rate can be 
achieved. 

The subband signals quantised in the block 9 
are applied to a generator unit 6. Starting from the 
quantised subband signals this unit 6 generates the 
second digital signal as illustrated in Figs. 1 and 2. 
This second digital signal, as stated hereinbefore, 
can be transmitted directly via the medium. How- 
ever, preferably this second digital signal is first 
adapted to be transmitted via the transmission me- 



dium 4 in a signal converter (not shown). Such a 
signal converter comprises, for example, an 8-to-10 
converter. Such an 8-to-10 converter is described 
in, for example, the Applicant's European Patent 

5 Application 150,082 (PHN 11.117). This converter 
converts 8-blt data words into 1 0-bit data words. 
Moreover, such a signal converter enables an Inter- 
leaving process to be applied. The purpose of all 
this is to enable an error correction to be per- 

70 formed on the information to be received at the 
receiving side. 

It is obvious that the signal received from the 
transmission medium 4 by the receiver 5 should 
then be de-interleaved and subjected to a 10-to-8 

75 conversion. 

The composition and content of the frames will 
now be explained in more detail. The first frame 
portion FD1 in Fig. 2 is shown in greater detail in 
Fig. 3. Fig.3 clearly shows that the first frame 

20 portion now comprises exactly 32 bits and is there- 
fore exactly equal to one information packet, name- 
ly the first information packet 1P1 of the frame. The 
first 16 bits of the information packet form the 
synchronising signal (or synchronising word). The 

26 synchronising signal may comprise for example 
only "ones". The bits 16 to 31 represent ttie sys- 
tem Information. The bits 16 to 23 represent the 
numtjer of information packets In a frame. This 
number consequently corresponds to p', both for 

30 the frame comprising P information packets and 
for frames comprising the additional information 
packet IP P' + l. P' can be 254 (1111 1110 in bit 
notation) at the most in order to avoid resemblance 
to the synchronising signal. The bits 24 to 31 

35 provide frame format information. Fig. 7 gives an 
example of the arrangement and significance of 
this information. Bit 24 indicates the type of frame, 
in the case of format A the second frame portion 
has another length (a different number of informa- 

40 Won packets) than In the case of format B. As will 
become apparent hereinafter, the second frame 
portion FD2 in the A format comprises 8 infonna- 
tion packets, namely the Information packets IP2 to 
IP9 Inclusive and In the B format it comprises 4 

45 information packets, namely the information pack- 
ets IP2 to IPS inclusive. The bits 25 and 26 indicate 
whether copying of the information is allowed. The 
bits 27 to 31 indicate the function mode. This 
means: 

50 a) the channel mode, which indicates the 

type of wide-band signal (as stated hereinbefore 
this may be a stereo audio signal, a mono audio 
signal, or an audio signal comprising two different 
signal components for example representing the 

55 same text but in two different languages). Hg. 8 
represents the channel mode. It Illustrates how the 
signal components are divided between the two 
channels (channel 1 and channel II) in the afore- 
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mentioned cases. 

b) the sample frequency Fs of the wide-band 

signal. 

c) the emphasis which may be applied to the 
wide-band digital signal in the transmitter. The val- 
ues 50 and 15 us are the time constants of the 
emphasis and CCITT J. The value 17 indicates a 
specific emphasis standard as defined by the 
CCITT (Comite Consultative Internationale de 
Telegraphie et Telephonie). 

The content of the frame portion FD2 in Flg.2 
will be described in more detail with reference to 
Figs. 9, 10 and 11. In the A format the second 
frame portion contains eight information packets. 
This is because it is assumed that the wide-band 
digital signal Sbb is converted into 32 subband 
signals (for every signal portion of the digital signal 
Sbb)- An allocation word having a length of four bits 
is assigned to every subband. This yields a total of 
64 allocation words having a length of 4 bits each, 
which can be accommodated exactly in eight in- 
formation packets. In the B format the second 
frame portion accommodates the allocation infor- 
mation for only half the number of subbands, so 
that now the second frame portion comprises only 
4 information packets. Fig. 9 Illustrates the signifi- 
cance of the four-bit allocation words AW. An al- 
location word associated with a specific subband 
specifies the number of bits by which the samples 
of the subband signal in the relevant subband are 
represented after quantisation in the unit 9. For 
example: the allocation word AW which is 0100 
indicates that the samples are represented by 5-bit 
words. Moreover, it follows from Fig. 9 that the 
allocation word 0000 indicates that no samples 
have been generated in the relevant subband. This 
may happen, for example, if the subband signal in 
an adjacent subband has such a large amplitude 
that this signal fully masks the subband signal in 
the relevant subband. Moreover, the allocation 
word 1111 is not used because It bears much 
resemblance to the sync word in the first informa- 
tion packet 1P1. Fig.10 indicates the sequence, in 
the case that the frame mode is A, in which the 
allocation words AW, j,m associated with the two 
channels j, were j = 1 or II, and the 32 subbands of 
the sequence number m, m ranging from 1 to 32, 
are arranged in the second frame portion. The 
allocation word AWl.l belonging to the first sub- 
band signal component of the first and lowest sub- 
band (channel I, subband 1) is inserted first. After 
this the allocation word AWIi,1 belonging to the 
second subband-signal component of the first and 
lowest subband (channel II, subband 1) is inserted 
In the second frame portion FD2. Subsequently, 
the allocation word AWI,2 belonging to the first 
subband-signal component of the second and low- 
est but one subband (channel I, subband 2) Is 



inserted in the frame portion FD2. This is followed 
by the allocation word AW 11,2 belonging to the 
second subband-signal component of the second 
subband (channel II, subband 2). This continues 

5 until the allocation word AW 11,4 belonging to the 
second subband-signal component of the fourth 
subband (channel II, subband 4) is inserted in the 
second frame portion FD2. The second information 
packet IP2 (slot 2) of the frame, which is the first 

10 Information packet In the frame portion FD2 of the 
frame, is then filled exactly. Subsequently, the in- 
fomiation packet IPS (slot 3) is filled with AW 1,5; 
AW 11,5; ... AW 11,8. This continues in the sequence 
as Illustrated in Fig. 10. Fig. 10 merely gives the 

75 indices j-m of the inserted allocation word AW, j,m. 
Fig. 11 indicates the sequence for the allocation 
words in the case of a B-format frame. In this case 
only allocation words of the subbands 1 to 16 are 
inserted. The sequence, as is illustrated in Fig. 10, 

20 corresponds to the sequence in which the separate 
samples belonging to a channel j and a subband m 
are applied to the synthesis filter means upon 
reception in the receiver. This will be explained in 
greater detail hereinafter. The serial data stream 

25 contains for example only frames in conformity with 
the A format. In the receiver the allocation informa- 
tion in each frame Is then employed for correctly 
deriving the samples from the information In the 
third frame portion of said frame. However, the 

30 serial data stream may also comprise, more or less 
alternately, both frames in conformity with the A 
format and frames in conformity with the B format. 
However, the frames in conformity with both for- 
mats may contain samples for all channels and all 

35 subbands in the third frame portion. A frame in 
conformity with the B format then lacks in fact the 
allocation information required to derive the sam- 
ples for the channels I or II of the subbands 17 to 
32 from the third frame portion of a B format frame. 

40 The receiver comprises a memory In which the 
allocation information included in the second frame 
portion of an A format frame can be stored. If the 
next frame is a B format frame only the allocation 
information for the subbands 1 to 16 and the chan- 
ts nels I and II in the memory is replaced by the 
allocation information included in the second frame 
portion of the B format frame, and for deriving the 
samples for the subbands 17 to 32 from the third 
frame portion of the B format frame use is made of 

50 the allocation information for these subbands de- 
rived from the preceding A format frame and still 
present in the memory. The reason for the al- 
ternate use of A format frames and B format 
frames is that for some subbands the allocation 

55 information in the present case the allocation in- 
formation for the higher subbands 17 to 32, does 
not change rapidly. Since during quantisation 
knows the allocation information for the various 
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subbands is available in the transmitter, this trans- 
mitter can decide to generate a B format frame 
instead of an A format frame if the allocation in- 
formation for the subbands 17 to 32 inclusive does 
not change (significantly). Moreover, this illustrates 
that now additional space becomes available for 
the inclusion of samples in the third frame portion 
FD3. For a specific value of P' the third frame 
portion of a B format frame is four information 
paci<ets longer than the third frame portion of an A 
format frame. Consequently, this enables the num- 
ber of bits by which the samples in the lower 
subbands 1 to 16 are represented to be increased, 
so that for these subbands a higher transmission 
accuracy can be achieved. IVloreover, if it is re- 
quired to quantise the lov/er subbands more ac- 
curately the transmitter can automatically opt for 
the generation of B format frames. This may then 
be at the expense of the accuracy with which the 
higher subbands are quantised. 

The third frame portion FD3 In Fig. 2 contains 
the samples of the quantised subband-signal com- 
ponents for the two channels. If the allocation word 
0000 Is not present in the frame portion FD2 for 
none of the subband channels this means that in 
the present example twelve samples are inserted in 
the third frame portion FD3 for each of the 32 
subbands and 2 channels. This means that there 
are 768 samples in total. In the transmitter the 
samples may be multiplied by a scale factor prior 
to their quantisation. For each of the subbands and 
channels the amplitudes of the twelve samples are 
divided by the amplitude of that sample of the 
twelve samples which has the largest amplitude. In 
that case a scale factor should be transmitted for 
every subband and every channel In order to en- 
able the inverse operation to be performed upon 
the samples at the receiving end. For this purpose 
the thlreJ frame portion then contains scale factors 
SF j,m, one for each of the quantised subband- 
signal components in the various subbands. In the 
present example, scale factors are represented by 
6-bit numbers, the most significant bit first, the 
values ranging from 000000 to 111110- The scale 
factors of the subbands to which these are al- 
located, i.e. whose allocation Information Is non- 
zero, are transmitted before the transmission of the 
samples begins. This means that the scale factors 
are accommodated in the leading part of the frame 
portion FD3 before the samples. This enables a 
rapid decoding in the receiver 5 to be achieved 
without the necessity of storing all the samples in 
the receiver, as will become apparent hereinafter. A 
scale factor SF j,m can thus represent the value by 
which the samples of the signal in the j-th channel 
of the m-th subband have been multiplied. Con- 
versely: the number one divided by said value may 
be stored as the scale factor so that at the receiv- 



ing end it is not necessary to divide the scale 
factors before the sample, are scaled up to correct 
values. 

For the frame format A the maximum number 
5 of scale factors is 64. If the allocation word AW j,m 
for a specific channel j and a specific subband m 
has the value 0000, which means that for this 
channel and this subband no samples are present 
In the frame portion FD3, it will not be necessary to 
10 include a scale factor for this channel and this 
subband. The number of scale factors is then 
smaller than 64. The sequence in which the scale 
factors SF j,m are inserted in the third frame por- 
tion FD3 is the same as that in which the allocation 
is words have been inserted in the second frame 
portion. The sequence is therefore as follows: 
SF 1,1; SF 11,1; SF 1,2; SF 11,2; SF 1,3; SF 11,3;.... SF 
1,32, SF 11,32. 

If it is not necessary to insert a scale factor the 

20 sequence will not be complete, the sequence may 
then be for example: 
....SF 1,4; SF 1,5; SF 11,5; SF 11,6;.... 
In this case the scale factors for the fourth subband 
of channel II and the sixth subband of channel I are 

25 not inserted. If the frame is a B format frame it may 
still be considered to insert scale factors in the 
third frame portion for all the subbands and all the 
channels. However, this is not necessarily so. In 
this case it would be possible to insert scale fac- 

30 tors in the third frame portion of the frame for the 
subbands 1 to 16 only, in the receiver this requires 
a memory in which all scale factors can be stored 
at the instant at which a previously arriving A 
format frame is received. Subsequently upon re- 

35 ception of the B format frame only the scale factors 
for the subbands 1 to 1 6 are replaced by the scale 
factors Included in the B format frame. The scale 
factors of the previously received A format frame 
for the subbands 17 to 32 are then used In order to 

40 restore the samples for these subbands Included in 
the third frame portion of the B format frame to the 
correct scale. 

The samples are inserted in the third frame 
portion FD3 in the same sequence as the allocation 

45 words and the scale factors, one sample for every 
subband of every channel in succession. This 
means: first all the first samples for the quantised 
subband signals for all the subbands of both chan- 
nels, then all the second samples, .... etc. The 

50 binary representation of the samples is arbitrary, 
the binary word comprising only "ones" preferably 
not being used again. 

The second digital signal generated by the 
transmitter 1 is subsequently applied to a transmis- 

ss sion medium 4 via the output 7, and by means of 
the transmission medium 4 this signal is trans- 
ferred to the receiver 5. The transmission via the 
transmission medium 4 may be a wireless trans- 
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mission, such as for example a radio transmission 
channel. However, other transmission media are 
also possible. In this respect an optical transmis- 
sion may be envisaged, for example via optical 
fibres or optical record carriers, such as~Compact- 
Disc-like media, or a transmission by means of 
magnetic record carriers utilising RDAT or SDAT- 
like recording and reproducing technologies, for 
which reference is made to the book "The art of 
digital audio" by J.Watkinson, Focal press, London 
1988. 

The receiver 5 comprises a decoder, which 
decodes the signal encoded in the coder 6 of the 
transmitter 1 and converts it into a replica of the 
wide-band digital signal supplied to the output 8. 

Fig. 12 shows a more detailed version of the 
receiver 5 in Fig. 4. The coded signal (the second 
digital signal) is applied to a unit 1 1 via the terminal 
1 0. The essential information in the incoming signal 
is contained in the scale factors and the samples. 
The remainder of the information in the second 
digital signal is merely required for a "correct 
bookkeeping", to allow a correct decoding. The 
decoding process is repeated for every incoming 
frame. The transmitter first derives the synchronis- 
ing and system information from the frames. The 
unit 19 each time detects the sync words situated 
in the first 16 bits of the first frame portion of every 
frame. Since the sync words of successive frames 
are each time spaced apart by an integral multiple 
of P or P +1 Information packets the sync words 
can be detected very accurately. Once the receiver 
Is in synchronism the sync word can be detected 
in the unit 19 in that in the unit 19 a time window 
having, for example, a length of one information 
packet Is opened after each time P information 
packets, so that only that part of the incoming 
information is applied to the sync word detector in 
the unit 19. If the sync word is not detected the 
time window remains open for the duration of an- 
other information packet because the preceding 
frame may be a frame comprising P' + 1 informa- 
tion packets. From these sync words a PLL In the 
unit 19 can derive a clock signal to control the 
central processing unit 18. It is evident from the 
above that the receiver should know how many 
information packets are contained in one frame. For 
this purpose the system information is applied to 
the switching means 15 via an input of the process- 
ing unit 18, which switching means are then in the 
position shown. The system information can now 
be stored in a memory 18a of the processing unit 
18. The information relating to the number of in- 
formation packets in a frame can be applied to the 
unit 19 via the control-signal line 20 to open the 
time window at the correct Instants for sync-word 
detection. When the system information is received 
the switch 15 is changed over to the lower position. 



The allocation information in the second frame por- 
tion of the frame can now be stored in the memory 
18b. If the allocation information in the incoming 
frame does not comprise an allocation word for all 

5 the subbands and channels this will have become 
apparent already from the detected system infor- 
mation. This may be for example the information 
indicating whether the frame is an A-format or a B- 
format frame. Thus, under the influence of the 

10 relevant Information contained in the system in- 
formation the processing unit 18 will store the 
received allocation words at the correct location In 
the allocation memory 18b. It is obvious that in the 
present example the allocation memory 18b com- 

75 prises 64 storage positions. If no scale factors are 
transmitted, the elements bearing the reference 
numerals 11, 12 and 17 may be dispensed with 
and the content of the third frame portion of a 
frame is applied to the synthesis filter means via 

20 the input 1 0, which is coupled to the input of said 
filter means via the connection 16. The sequence 
in which the samples are applied to the filter 
means 21 is the same as the se;quence in which 
the filter means 21 process the samples in order to 

25 reconstruct the wide-band signal. The allocation 
information stored in the memory 18b is required in 
order to divide the serial data stream of samples 
into individual samples in the filter means 21 , each 
sample having the correct number of bits. For this 

30 purpose the allocation information is applied to the 
filter means 21 via the line 22. The receiver further 
comprises a deemphasis unit 23 which subjects 
the reconstructed digital signal supplied by the 
filter 21 to deemphasis. For a correct deemphasis 

35 the relevant Infonnation in the bits 24 to 31 of the 
first frame portion should be applied from the 
memory 18a to the deemphasis unit 23 via the line 
24. 

If the third frame portion also contains the 

40 scale factors SF j,m the receiver will comprise the 
switch 11, the memory 12 and the multiplier 17. All 
the instant at which the third frame portion FD3 of 
a frame arrives the switch 11 is in the lower posi- 
tion under the influence of a control signal applied 

45 by the processing unit 18 via the line 13. The scale 
factors can now be applied to the memory 12. 
Under the influence of address signals applied to 
the memory 12 by the processing unit 18 via the 
line 14 the scale factors are stored at the correct 

50 locations in the memory 12. The memory 12 has 
64 locations for the storage of the 64 scale factors. 
Again, when a B-format frame Is received, the 
processing unit 18 applies such address signals to 
the memory 12 that only the scale factors for the 

55 subbands I to 16 are overwritten by the scale 
factors in the B-format frame. Subsequently, the 
switch 1 1 changes over to the shown (upper) posi- 
tion under the influence of the control signal ap- 



10 



19 



EP 0 402 973 A1 



20 



plied via ttie line 13, so that the samples are 
applied to the multiplier 17. Under the influence of 
the aliocation information, which is now applied to 
the multiplier 17 via the line 22, the multiplier first 
derives the individual samples of the correct bit 
length form the serial data stream applied via the 
line 16. Subsequently, the samples are multiplied 
so as to restore them to the correct values of the 
samples prior to scaling-down in the transmitter. If 
the scale factors stored in the memory 12 are the 
scale factors by which the samples have been 
scaled down In the transmitter these scale factors 
should first be inverted (one divided by the scale 
factor) and should then be applied to the multiplier 
17. Obviously, it is also possible to invert the scale 
factors upon reception before they are stored in the 
memory 12. If the scale factors in the frames are 
already equal to the value by which the samples 
should be scaled up during reception they can be 
stored directly in the memory 12 and they can be 
applied directly to the multiplier 17. It is evident 
that no memory is required to store all these sam- 
ples before the signal processing performed upon 
the samples contained in the frame begins. At the 
instant at which a sample arrives via the line 16 all 
the infonnation required for processing this sample 
is already available, so that processing can be 
carried out immediately. This entire process is ef- 
fected under the influence of control signals and 
clock signals applied to all the parts of the trans- 
mitter by the processing unit 18. By no means all 
the control signals are shown. This is not neces- 
sary because the operation of the receiver will be 
obvious to those skilled in the art. Under control of 
the processing unit 18 the multiplier 17 multiplies 
the samples by the appropriate multiplication fac- 
tors. The samples, which have now been restored 
to the correct amplitude, are applied to the re- 
construction filter 18 in which the subband signals 
are reconverted to form the wide-band digital sig- 
nal. A further description of the receiver is not 
necessary because such receivers are generally 
known, see for example the publication "Low bit 
rate coding of high-quality audio signals. An in- 
troduction to the MASCAM system" by G. Theile et 
al in EBU Technical Review, no. 230, August 1988. 
Moreover, it will be evident that if the system 
information is also transmitted the receiver can be 
highly flexible and can correctly decode the signals 
even in the case of second digital signals with 
different system information. 

Fig.13 shows diagrammatically yet another em- 
bodiment of the transmitter, which now takes the 
form of a recording device for recording the wide- 
band digital signal on the record carrier, in the 
present case a magnetic record carrier 25. The 
encoder 6 supplies the second digital signal to a 
recording device 27 comprising a write head 26 by 



means of which the signal is recorded in a track on 
the record carrier. It is then possible to record the 
second digital signal in a single track on the record 
carrier, for example by means of a helical-scan 

5 recorder, in which case the single track is then in 
fact divided into juxtaposed tracks which are in- 
clined relative to the longitudinal direction of the 
record carrier. An example of this is an RDAT-like 
recording method. Another method is to split the 

10 information and simultaneously recording tiie split 
information in a plurality of juxtaposed tracks which 
extend on the record carrier in the longitudinal 
direction of the record carrier. For this the use of 
an SDAT-like recording method may be consid- 

75 ered. A comprehensive description of the two 
above methods can be found in the aforementioned 
book "The art of a digital audio" by J. Watkinson. 
Again it is to be noted that the signal supplied by 
the unit 6 may be first be encoded in a signal 

20 converter. This encoding may again be an 8-to-10 
conversion followed by an interleaving process, as 
described with reference to Fig. 4. If the encoded 
information is recorded on the record carrier in a 
plurality of adjacent parallel frack, this signal con- 

25 verter should also be capable of assigning the 
encoded information to the various tracks. 

Rg. 14 shows diagrammatically an embodi- 
ment of the receiver 5, which in the present case 
takes the form of a read device for reading the 

30 record carrier 25 on which the wide-band digital 
signal has been recorded in the form of the second 
digital signal by means of the device shown in 
Fig.13. The second digital signal is read from a 
track on the record carrier by the read head 29 and 

35 is applied to the receiver 5, which may be for 
example of a construction as shown in Fig.12. 
Again the read device 28 may be constructed to 
carry out an RDAT-like or an SDAT like reproduc- 
ing method. Both methods are again described 

to comprehensively in the aforementioned book by 
Watkinson. If the signal supplied by the unit 6 In 
the recording device shown in Fig. 13 has been 
converted, for example in an 8-to-10 conversion 
and in an interleaving step, the encoded signal 

45 read from the record carrier 25 should first be de- 
interleaved and should be subjected to 10-tO-8 
conversion. Moreover, if the encoded signal has 
been recorded in a plurality of parallel tracks the 
reproducing unit shown in Fig. 14 should arrange 

so the information read from these tracks In the cor- 
rect sequence before further processing Is applied. 

Fig. 15 shows a number of other possibilities of 
inserting the scale factors and the samples in the 
third frame portion FD3 of a frame. Fig. 15a illus- 

55 trates the above described method in which the 
scale factors SF for all the subbands m and chan- 
nels (I or II) are inserted in the third frame portion 
before the samples. Fig. 15b illustrates the same 
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situation as Fig. 15a, but in this case it diagram- 
matlcally represents tlie storage capacity for the 
scale factors SF l,m and SF ll.m and the asso- 
ciated X samples for these two channels in the 
subband m. Fig. 15b shows the samples for the 
two channels in the subband m combined to 
blocks, whereas normally they are distributed with- 
in the third frame portion. The samples have a 
length of y bits. In the above example x is 12 and y 
is now taken to be 8. Fig.15c shows another for- 
mat. The two scale factors for the first and the 
second channel in the subband are still present in 
the third frame portion. However, instead of the x 
samples for both channels (the left and right chan- 
nels for a stereo signal) in the subband m (i.e. 2x 
samples in total) only x samples for the subband m 
are included in the third frame portion. These x 
samples are obtained, for example, by adding cor- 
responding samples in each of the two channels to 
one another. In fact, a monophonic signal Is ob- 
tained in this subband m. The x samples in Fig. 
15c each have a length of z bits. If z Is equal to y 
this saves room in the third frame portion, which 
can be used for samples requiring a more accurate 
quantisation. . It is alternatively possible to express 
the X samples of the mono signal in Z = 2y ( = 16) 
bits. Such a signal processing is applied If the 
phase difference between the left-hand and the 
right-hand signal component in a subband is irrele- 
vant but the waveform of the monophonic signal is 
important. This applies in particular to signals in 
higher subbands because the phase-sensitivity of 
the ear for the frequency in these subbands is 
smaller. By expressing the x samples of the mono 
signal in 16 bits the waveform is quantised more 
accurately, while the room occupied by these sam- 
ples in the third frame portion is equal to that in the 
example illustrated in Fig.lSb. Yet another possibil- 
ity is to represent the samples in Flg.15 by for 
example 12 bits. The signal definition Is then more 
accurate than In the example Illustrated In Fig. 15b 
whilst In addition room is saved in the third frame 
portion. When at the receiving end the signals 
included in the third frame portion as illustrated in 
Fig.15c are reproduced a stereo effect is obtained 
which is referred to as "intensity stereo". Here, 
only the intensities of the left-channel and the right- 
channel signals (in the subband m) can differ be- 
cause of a different value for the scale factors SF 
l,m and SF II, m. 

Fig. 15d gives still another possibility. In this 
case there is only one scale factor SFm for both 
signal components In the subband m. This is a 
situation which may occur in particular for low- 
frequency subbands. Yet another possibility, which 
is not shown, is that the x samples for the channels 
I and II of the subband m, as In Fig. 15b, do not 
have associated scale factors SF l,m and SF ll,m. 



Consequently, these scale factors are not inserted 
in the same third frame portion. In this case the 
scale factors SF l.m and SF ll.m included in the 
third frame portion of a preceding frame must be 

5 used for scaling up the samples in the receiver. 
All the possibilities described with reference to 
Rg.15 can be employed In the transmitter in order 
to achieve a most efficient data transfer via the 
transmission medium. Thus, frames as described 

70 with reference to Fig. 15 may occur alternately in 
the data stream. It will be appreciated that, if the 
receiver should yet be capable of correctly decod- 
ing these different frames, information about the 
structure of these frames should be included in the 

15 system information. 

Fig. 16 shows the transmitter in more detail. 
The Figure shows how the various Items of in- 
formation can be combined to form the serial data 
stream as given in Figs.1, 2 and 3. Fig. 16 in fact 

20 shows a more detailed version of the encoder 6 In 
the transmitter 1. The encoder comprises a central 
processing unit 30, which controls a number of 
devices in the encoder. The encoder comprises a 
generator 31 included in the processing unit 30 for 

ss generating the synchronising information and the 
system information, as described with reference to 
Rg.3, a generator 32 for defining the allocation 
Information, a generator 33 (optional) for determin- 
ing the scale factors, a generator 34 for defining 

30 the samples for a frame. The generator 35 is a 
generator which is capable of generating the addi- 
tional information packet IP P +1. The outputs of 
these generators are coupled to associated inputs 
of switching means 40 in the form of a five-position 

35 switch whose output is coupled to the output 7 of 
the encoder 6. The switching means 40 are also 
controlled by the processing unit 30. The various 
generators are controlled via the lines 41 .1 to 41 .4. 
The operation of the transmitter will be described 

40 for a mono signal divided Into M subband signals. 
These M subband signals Ssbi to Ssbm are applied 
to the terminals 45.1, 45.2, .... 45.M. For example, 
blocks of 12 samples of each of the subband 
signals are taken together. In the unit 46.1 to 46M, 

45 if present, the twelve samples in a block are scaled 
to the amplitude of the largest sample in the block. 
The M scale factors are applied to the unit 33 (if 
present) via the lines 47.1 to 47.M. The subband 
signals are applied both to the M quantisers 48.1 to 

50 48.M and to a unit 49. For every subband the unit 
49 defines the number of bits with which the rel- 
evant subband signal should be quantised. This 
information is applied to the respective quantisers 
48.1 to 48.M via the lines 50.1 to 50.M, so that 

55 these quantisers correctly quantise the 12 samples 
of each of the subband signals. Moreover this 
(allocation) information Is applied to the unit 32. 
The samples of the quantised subband signals are 
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applied to the unit 34 via the lines 51.1 to 51 .M. 
The units 32, 33 and "34 arrange the allocation 
information, the scale factors and the samples in 
the correct sequence Le. in the sequence as de- 
scribed hereinbefore. l\/loreover, the processing unit 
30 has generated the synchronising information 
and the system information associated with the 
frame to be generated, in which said information 
stored in the units 32, 33 and 34 should be in- 
serted. In the shown position of the switching 
means 40 the synchronising and system informa- 
tion for a frame is supplied by the generator 31 
and fed to the output 7. Subsequently, the switch 
40 is set to the second position from the top under 
the influence of the control signal supplied by the 
CPU 30, via the line 53 so that the output of the 
generator 32 is coupled to the output 7. Now the 
allocation information is applied to the output 7 by 
the generator 32. The sequence of the allocation 
information is as described with reference to Fig. 
1 0 or 1 1 . After this the switch 40 is set to the third 
position from the top. This means that the output of 
the generator 33 is coupled to the output 7. The 
generator 33 now supplies the scale factors in the 
correct sequence to the output 7. The switch 40 Is 
now set to the next position, so that the output of 
the generator 34 is coupled to the output 7. Now 
the generator 34 supplies the samples In the var- 
ious subbands in the correct sequence to the out- 
put 7. In this cycle exactly one frame is applied to 
the output 7. Subsequently, the switch 40 is reset 
to the top position. A new cycle is started, in which 
a subsequent block of 12 samples for each sub- 
band is encoded and a subsequent frame can be 
generated on the output 7. In some cases, for 
example if the sample frequency Fs is 44.1 kHz, 
see Fig. 5, an additional information pacl<et (the 
dummy slot, see Fig. 2) must be added. In that 
case the switch will be set from the position in 
which the generator 34 is coupled to the bottom 
position. The output of the generator 35 is now 
coupled to the output 7. Now the generator 35 
generates the additional Information packet IP 
P' + I, which is applied to the output 7. After this 
the switch 40 is reset to the top position to start the 
next cycle. It is obvious that, if the signal received 
by the transmitter is to be corrected for errors 
caused during transmission of the signal, a specific 
channel coding should be applied to the second 
digital signal. In addition, it is required to modulate 
the second digital signal prior to transmission of 
the second signal. Thus, a digital signal is transmit- 
ted via the transmission medium, which signal may 
not be directly identifiable as the second signal but 
which has been derived therefrom. Further, it Is to 
be noted that, for example in the case that the 
subbands have different widths, the number of 
samples for the various subbands inserted in one 



third frame portion may differ and are likely to 
differ. It is assumed, for example, that a division 
into three subbands is used, a lower subband SBi , 
a central subband SB2 and a upper subband SB3. 

5 The upper subband SBs will have a bandwidth 
which is, for example, twice as large as that of the 
other two subbands. This means that the number 
of samples inserted in the third frame portion for 
the subband SB3 is also twice as large as for each 

10 of the other subbands. The sequence in which the 
samples are applied to the reconstruction filter in 
the receiver may then be: the first sample of SBi , 
the first sample of SB3, the first sample of SB2, the 
second sample of SB3, the second sample of SBi, 

75 the third sample of SB3, the second sample of 
SB2, the fourth sample of SB3 etc. The se- 
quence in which the allocation information for these 
subbands is then inserted in the' second frame 
portion is now: first the allocation word for SBi, 

20 then the allocation word of SB3, subsequently the 
allocation word for SB2. The same applies to the 
scale factors. Moreover, the receiver can derive 
from the system information that in this case the 
cycle comprises groups of four samples each, 

25 each group comprising one sample of SBi, one 
sample of SB3, one sample of SB2 and subse- 
quently another sample of SB3. 

Figure 17 shows another structure of the first 
frame portion FD1. Again the first frame portion 

30 FD1 contains exactly 32 bits and therefore cor- 
responds to one information packet. The first 16 
bits again constitute the synchronising signal (or 
synchronisation word). The synchronisation word 
may again be the same as the synchronisation 

35 word of the first frame portion FD1 in Fig. 3. The 
information accommodated in bits 16 through 31 
differs from the information in bits 16 through 31 in 
Fig. 3. The bits bi6 through big represent the bit 
rate index (BR index). The bit rate index is a 4-bit 

40 number whose meaning is illustrated in the Table 
in Fig. 18. If the bit rate index is equal to the 4-blt 
digital number '0000' this denotes the free-format 
condition, which means that the bit rate is not 
specified and that the decoder has to depend upon 

45 the synchronisation word alone to detect the begin- 
ning of a new frame. The 4-bit digital number 
'1111'ls not employed in order not to disturb the 
synchronisation word detection. In the second col- 
umn of the Table in Fig. 18 the bit rate index is 

60 represented as a decimal number corresponding to 
the 4-bit digital number. The corresponding bit rate 
values are given in column 1. 

The bits 20 and 21 represent the sample fre- 
quency Fs, see Fig. 1 8. 

55 Rg. 18 shows the four possible 2-bit digital 

numbers for the bits bao and bai and the asso- 
ciated sample frequency. Bit 22 indicates whether 
the frame comprises a dummy slot, in which case 
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b22 = '1', or does not comprise a dummy slot, in 
which case b22 = '0'. The information in the bits 
bi6 through b22 makes it possible to detemiine 
how many information packets are actually present 
in the frame. This means again that the first frame 
portion contains information related to the number 
of information packets in the frame. As ng is known, 
which is the number of samples of the wide-band 
signal whose corresponding information belonging 
to the second digital signal is accommodated in 
one frame, in the present example ns = 384, it is 
possible to determine how many information pack- 
ets B are present in the frame by means of the 
data in the Table In Fig. 8, the padding bit b22 and 
the formula 

BR ng 

P = — X . 

N Fs 

The bit bza is intended for specifying a future 
extension of the system. This future extension. will 
be described hereinafter. For the time being this bit 
is assumed to be *0'. The content of the first frame 
portion, as regards the bits 24 b through bai, will 
be described with reference to Figs. 19 and 20. 
The bits b24 and bzs give the mode Indication for 
the audio signal. 

For the four possibilities of this two-bit digital 
number Fig. 20 shows whether the wide-band digi- 
tal signal is a stereo audio signal ('00'), a mono 
signal ('11'), a bilingual signal ('10'), or an Intensity 
stereo audio signal ('01'). In the last-mentioned 
case the bits 26 and 27 indicate which subbands 
have been processed In accordance with the inten- 
sity stereo method. Figure 20 Indicates for the 
respective two-bit numbers "OO', '01', '10', and 
•11'that the subbands 5-32, 9-32, 13-32 and 17-32 
have been processed in accordance with the inten- 
sity stereo method. As stated hereinbefore intensity 
stereo can be applied to the higher subbands be- 
cause the ear Is less phase-sensitive for the fre- 
quencies in these subbands. The bit b28 can be 
used as a copyright bit. If this bit is '1* this means 
that the information is copy-protected and 
should/cannot be copied. The bit bzs can indicate 
that the Information Is original information (b29 = 
'1'), for example In the case of the prerecorded 
tapes, or Infomiation which has been copied (b23 
= '0'). The bits bso and bai specify the emphasis 
which may have been applied to the wide-band 
signal in the transmitter, see also the description 
with reference to Fig. 7. 

Hereinafter, another configuration of the second 
frame portion FD2 will be described for the various 
mode indications represented by the bits bzi 



through b27 in the first frame portion. Again the 
second frame portion comprises the 4-bit allocation 
words whose meaning has been described with 
reference to Fig. 9. For the stereo mode (b24 , bss 
5 = 00) and the bilingual mode (bo^, b25 = 10) the 
second frame portion FD2 again has a length of 8 
information packets (slots) and is composed as 
described with reference to Fig. 10. In the stereo 
mode 'ir in Fig. 10 then represents, for example, 
70 the left-channel component and 'IP the right chan- 
nel component. For the bilingual mode T denotes 
one language and '11' denotes the other language. 
For the mono mode (b24, bss = 11) the length of 
the second frame portion FD2 is of course only 4 
75 information packets (slots). Fig. 21 Illustrates the 
sequence of the allocation words for the various 
subbands 1 through 32' in the four information 
packets (slots) 2 through 5. Thus, every quantity 
M-i represents a four-bit allocation word which 
20 specifies the number of bits in every sample in the 
subband of the sequence number i, i ranging from 
1 to 32. In the intensity stereo mode (bzi, bss = 
01) there are four possibilities indicated by means 
of the bits bas and b27, see Fig. 20. All these 
35 possibilities result in a different content of the sec- 
ond frame portion FD2. 

Figs. 22a to 22d Illustrate the four different 
contents of the second frame portion. If the switch 
bits b26, b27 are 'DO'the signals in the subbands 1 
30 through 4 are normal stereo signals and the signals 
in the subbands 5 through 32 are intensity-stereo 
signals. This means that for the subbands 1 
through 4 for the left-hand and right-hand channel 
components in these subbands the associated al- 
as location words should be stored in the second 
frame portion. In Fig. 22a this is represented by the 
consecutive allocation words AW (1, 1); AW (R, 1); 
AW (1, 2); AW (R, 2); ... AW (R, 4), stored in the 
slot 2 of the frame, i.e. the first slot of the second 
40 frame portion. Fig. 22a only gives ttie indices (i-j) 
of the allocation words, i being equal to L or R and 
Indicating the left-hand and the right-hand channel 
component respectively, and j ranging from 1 
through 4 and representing the sequence number 
45 of the subband. For the subbands 5 through 32 the 
left-hand and the right-hand channel components 
contain the same series of samples. The only dif- 
ference resides in the scale factors for the left-hand 
and the right-hand channel components in a sub- 
so band. Consequently, such a subband requires only 
one allocation word. The allocation words AW (i, j) 
for these subbands 5 through 32 are indicated by 
the indices IVI-j, where i is consequently equal to M 
for all the subbands and where j ranges from 5 
55 through 32. 

Fig. 22a shows that 4 1/2 information packets 
are required for inserting the 36 allocation words In 
the second frame portion. If the switch bits b2s, ba? 
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are '01 'the signals in the subbands 1 through 8 will 
be normal stereo signals and the signals in the 
subbands 9 through 32 will be intensity-stereo sig- 
nals. This means that for each of the subbands 1 
through 8 two allocation words AW(L, j) and AW- 
(R,j) are required and that for each of the subbands 
9 through 32 only one allocation word AW(M,j) Is 
required. This implies that in total 40 allocation 
words are needed, included in five Information 
packets (slots), i.e. IP2 through IP6, of the frame. 
This Is illustrated in Fig. 22b. In this case the 
second frame portion FD2 has a length of five 
information packets (slots). 

If the switch bits bac, b27 are '10'the signals in 
the subbands 1 through 12 will be normal stereo 
signals and the signals in the subbands 13 through 
32 will be intensity-stereo signals. Fig. 22c gives 
the structure of the second frame portion FD2 with 
the allocation words for the various subbands. The 
second frame portion now has a length of 5 1/2 
information packets (slots) in order to accommo- 
date all the allocation words. If the switch bits bze, 
b27 are '11 'the signals in the subbands 1 through 
16 will be normal stereo signals and the signals in 
the subbands 17 through 32 will be intensity-stereo 
signals. Now 48 allocation words are needed, which 
are inserted in the second frame portion, which 
then has a length of 6 Information packets (slots), 
see Fig. 22d. 

What has been stated above about the scale 
factors is also valid here. When it is assumed that 
an allocation word 0000 has been assigned neither 
to any of the subbands nor to any of the channels, 
64 scale factors are required both for the stereo 
mode and for the intensity-stereo modes. This is 
because in all the intensity-stereo modes every 
mono subband should have two scale factors to 
enable intensity-stereo to . be realised for the left- 
hand and the right-hand channel in this subband, 
see Fig. 15c. 

It is obvious that In the mono mode the number 
of scale factors is halved, i.e. 32, again assuming 
that the allocation word 0000 has not been as- 
signed to any of the subbands. 

A method of determining the 6-bit scale factors 
will now be explained below. As stated herein- 
before, the sample having the largest absolute val- 
ue is determined for every 12 samples of a sub- 
band channel. 

Fig. 24a shows this maximal sample ISmaj.]. The 
first bit, designated SQN, is the sign bit and is '0' 
because it relates to the absolute value of Smax- 
The samples are represented in two's complement 
notation. The sample comprises k 'zeros' followed 
by a "1 ". The values of the other bits of the 24-bit 
digital number are not relevant and can be either 
■O'or '1'. 

|Smax| is now multiplied by 2\ see Fig. 24b. 



Subsequently |Smax|-2'' is compared with a digital 
number DVi equal to 010100001100000000000000 
and a digital number DV2 equal to 
011001100000000000000000. If |Sma42'< < DVi a 

5 specific constant p is taken to be 2. If DVi i 
\S„Jl.2^ < DVa, then p is taken to be 1. If |S„ax|-2'' 
^ DV2, then p = p. The number k is limited to 0 ^ k 
i 20. The scale factor is now determined by the 
numbers k and p In accordance with the following 

70 formula. 

SF = 3k + p. 

Consequently, the maximum value for SF is 62. 
This means that the scale factors can be repre- 
sented by 6-bit numbers, the six-bit number 

75 111111 (which corresponds to the decimal number 
63) not being used. In fact, the 6-bit binary num- 
bers are not the scale factors but they are in a 
uniquely defined relationship with the actual scale 
factors, as will be set forth below. All the 12 sam- 

20 pies S are now multiplied by a number which is 
related to the values for k and p. The 12 samples 
are each multiplied as follows. 
S'= Sx2''xg(p) 

where the number g(p) has the following relation 
25 with p: 

g(p) = 1 for p = 0 

g(p) = 1 + 2-2 + 2-8 + 2-^° + 2-16 + 2-18 + 2-23 for p 
= 1 

g(p) 

30 1 + 2-'' + 2"* + 2-5 + 2"^ + 2-^ + 2-'° + 2-13 + 2-'=- 
+ 2--' 5 +2-1 7 +2-1 3 -^ 2-2° for p = 2. 

The parameter k specifies the number of 6 dB 
steps and the factors (g(1 ) and g(2) are the closest 
approximations to steps of 2 dB. The samples S 

35 thus scaled are now quantised to enable them to 
be represented by g-bit digital numbers in two's 
complement notation. In Fig. 25 this is illustrated 
for q = 3. The scaled samples s' have values 
between + 1 and -1 see Fig. 25a. In the quantiser 

40 these samples must be represented by q bits, q 
con-esponding to the allocation value for the rel- 
evant subband (channel). Since, as stated above, 
the q-bit digital number comprising only 'ones' is 
not used to represent a sample the total interval 

45 from -1 to +1 should be divided over 2'*'^ smaller 
intervals. For this purpose the scaled samples S 
are transformed into the samples S in accordance 
with the formula s"= s'(1-2-'>)-2-^. 

The samples S are subsequently truncated at 

so q bits, see Fig. 25c. Since the '111' representation 
is not permissible the sign bits are inverted, see 
Fig. 25d. The q( = 3)-bit numbers given in Fig. 25d 
are now inserted in the third frame portion FD3, 
see Fig. 2. 

55 Samples s' which comply with -0.71SS's-0.14 
are represented by the digital number '001 '. This 
proceeds similarly for samples S of larger values 
up to samples which comply with 0.71 S S'<1 and 
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which are represented by the digital number "110". 
Consequently, the digital number '111 'is not used. 

Dequantisation at the receiving side is effected 
in a manner inverse to the quantization at the 
transmission side, see Fig. 26. This means that first 
the sign bits of the q-bit digital numbers are in- 
verted to obtain the normal two's complement nota- 
tion, see Rg. 26b. 

Subsequently the samples s' are derived from 
the transformed samples S by means of the for- 
mula S' = (S" + 2-^*') (1 + 2-^ + 2:^1 + 2-^ + 
2-"^ + ...). see Figs. 26c and 26d. 
The values S' thus obtained are now situated ex- 
actly within the original intervals in Fig. 25a. At the 
receiving side the samples S are subsequently 
scaled to the original amplitudes by means of the 
transmitted information k, p which is related to the 
scale factors. Thus, at the receiving side a number 
g'{p) complies with: 
g'(p) = 1 for p = 0 

g'{p) = + 2-2 + 2-5 + 2-6 for p = 1 
g'(p) = 2-' + 2-3 + 2-8 + 2-3 for p = 2. 
Scaling to the original amplitudes is now effected 
using the following formula: 
S = s'.Z-i'.g^p). 

In the two possible versions of a frame as 
described with reference to Figures 2 and 3 and 
Figures 2, 17 and 19 respectively the third frame 
portion may not be filled entirely with information. 
This will occur more often and sooner as the al- 
gorithms for subband coding, i.e. the entire process 
of dividing the signal into subband signals and the 
subsequent quantization of the samples in the var- 
ious subbands, are improved. In particular, this will 
enable the information to be transmitted with a 
smaller number of bits (average number per sam- 
ple). The unused part of the third frame portion can 
then be utilized for transmitting additional informa- 
tion. In the first frame portion FD1 in Fig. 17 
allowance has been made for this by means of the 
"future-use" bit bzs. Normally, this bit Is '0', as will 
be apparent from Fig. 18. 

If an additional signal has been inserted in the 
third frame portion FD3 of a frame, the future-use 
bit bz3 in the first frame portion FD1, see Fig. 17, 
will be '1'. During reading of the first frame portion 
FD1 this makes it possible for the receiver to 
detect the frame contains additional information. 
The allocation information and the scale factors, 
see Fig. 23, inform the receiver that only the part of 
the third frame portion FD3. marked FD4 in Fig. 23, 
contains quantised samples of the subband signals. 
The remainder, marked FD5 in Fig. 23, now con- 
tains the additional information. The first bits in this 
frame portion FD5 are designated "EXT INFO' or 
extension information. These bits indicate the type 
of additional information. The additional infomiation 
may be, for example, an additional audio channel. 



for example for the transmission of a second stereo 
channel. Another possibility is to use these two 
" additional audio channels to realise 'surround 
sound' together with the audio subband signals in 

5 the frame portion FD4. In that case the front-rear 
infomiation required for surround sound may be 
included in the frame portion FD5. In the part, 
marked FD6 the frame portion FD5 may again 
contain allocation information, scale factors and 

70 samples (in this order) and the sequence of the 
allocation words and the scale factors may then be 
similar to the sequence as described with refer- 
ence to Figs. 2 and 3 and Figs. 2. 17 and 19. 

In the case of 'surround sound' simple receiv- 

75 ers may merely decode the stereo audio informa- 
tion in the frame portions FD2 and FD3, except for 
the frame portion FD5. More sophisticated receiv- 
ers are then capable of reproducing the surround- 
sound information and for this purpose they also 

20 employ the information in the frame portion FD5. 

The extension-info bits may also indicate that 
the information in the frame portion FD6 relates to 
text, for example in the form of ASCII characters. It 
may even be considered to insert video or picture 

25 information in the frame portion FD6, said informa- 
tion again being characterized by the extension-info 
bits. 

It is to be noted that the invention is not limited 
to the embodiments shown herein. The invention 
30 also relates to those embodiments which differ 
from the embodiments shown herein with respect 
to features which are not relevant to the Invention 
as defined in the Claims. 

35 

Claims 

1. A digital transmission system comprising a 
transmitter and a receiver, for transmitting a wide- 

40 band digital signal of a specific sample frequency 
Fs, for example a digital audio signal, via a trans- 
mission medium, and for receiving said signal, the 
transmitter having an input terminal for receiving 
the wide-band digital signal, which input terminal is 

45 coupled to an input of a signal source which forms 
part of the transmitter and which is constructed to 
generate a second digital signal and supply said 
signal to an output, which second digital signal 
comprises consecutive frames, each frame com- 

50 prising a plurality of information packets, each in- 
formation packet comprising N bits, N being larger 
than 1, the receiver comprising a decoder having 
an input for receiving the second digital signal, 
which decoder has an output coupled to an output 

55 terminal to supply the wide-band digital signal, 
characterized In that if P in the formula 



16 



31 



EP 0 402 973 A1 



32 



BR ng 

P = X 

N Fg 

is an integer, wiiere 

BR is tlie bit rate of the second digital signal, and 
ns is the number of samples of the wideband digital 
signal whose corresponding information, which be- 
longs to the second digital signal, is included in 
one frame of the second digital signal, the number 
of information packets B in one frame is P, and in 
that, if P is not an integer, the number of informa- 
tion packets in a number of the frames is P , P 
being the next lower integer following P, and the 
number of information packets in the other frames 
is equal to p' + 1 so as to exactly comply with the 
requirement that the average frame rate of the 
second digital signal should be substantially equal 
to Fs/ns and that a frame should comprise at least 
a first frame portion including synchronising in- 
formation. 

2. A transmission system as claimed in Claim 
1 , chat'acterized In that the first frame portion con- 
tains further information related to the number .of 
information packets in the frame. 

3. A transmission system as claimed in Claim 1 
or 2, characterized in that a frame comprises a first 
frame portion, a second frame portion and a third 
frame portion, the first frame portion further includ- 
ing system information and the second and the 
third frame portion including signal information. 

4. A transmission system as claimed in Claim 
1 , 2 or 3, characterized in that if a frame comprises 
p' + l information packets, the first frame portion 
contains infomnation corresponding to P . 

5. A transmission system as claimed in Claim 3 
or 4, the transmitter comprising a coder comprising 
signal-splitting means responsive to the wide-band 
digital signal to generate a second digital signal in 
the form of a number of M subsignals, M being 
larger than 1 , and comprising means for quantising 
the respective subsignals, characterized in that the 
second frame portion of a frame contains allocation 
information which, for at least a number of subsig- 
nals, indicates the number of bits representing the 
samples of the quantised subsignals derived from 
said subsignals, and in that the third frame portion 
contains the samples of at least said quantised 
subsignals (if present). 

6. A transmission system as claimed In Claim 
5, in which the signal-splitting means take the form 
of analysis-filter means responsive to the wide- 
band digital signal to generate a number of M 
subband signals, which analysis-filter means divide 
the signal band of the wide-band digital signal, 
applying sample-frequency reduction, into succes- 



sive subbands having band numbers m increasing 
with the frequency, and in which the quantisation 
means are adapted to quantise the respective sub- 
band signals block by block, characterized in that 

5 for at least a number of the subband signals the 
allocation information in the second frame portion 
of a frame specifies the number of bits represent- 
ing the samples of the quantised subband signals 
derived from said subband signals, and in that the 

70 third frame portion contains the samples of at least 
said quantised subband signals if present). 

7. A transmission system as claimed in Claim 
6, characterized in that in addition the third frame 
portion includes information related to scale factors, 

15 a scale factor being associated with at least one of 
the quantised subband signals contained in the 
third frame portion, and in that the scale factor 
information is included in the third frame portion 
before the quantised subband signals. 

20 8. A transmission system as claimed in Claim 6 
or 7, in which the receiver comprises a decoder 
comprising synthesis-filter means responsive to the 
respective quantised subband signals to construct 
a replica of the wide-band digital signal, which 

25 synthesis filter means combine the subbands, ap- 
plying sample frequency restoration to form the 
signal band of the wide-band digital signal, char- 
acterized in that the samples of the subband sig- 
nals (if present) are inserted in the third frame 

30 portion in a sequence corresponding to the se- 
quence in which said samples are applied to the 
synthesis-filter means upon reception in the re- 
ceiver. 

9. A transmission system as claimed in Claim 
35 8, characterized in that the allocation information 

for the various quantised subband signals Is in- 
serted in the second frame portion in a similar 
sequence. 

10. A transmission system as claimed in Claim 
40 9, characterized in that the scale information for the 

scale factors is inserted in the third frame portion in 
a sequence corresponding to the sequence in 
which the allocation information for the quantised 
subband signals associated with said scale factors 
45 is included in the second frame portion. 

11. A transmission system as claimed in any 
one of the Claims 6 to 10, the wide-band digital 
signal comprising a first and a second signal com- 
ponent, for example a digital stereo signal, the 

50 analysis-filter means being adapted to be respon- 
sive to the first and the second signal components 
to generate a number of M subband signals, each 
subband signal comprising a first and a second 
subband signal component, the means being fur- 

55 ther adapted to quantise the respective first and 
second subband signal components in a specific 
subband, characterized In that the second frame 
portion of a frame includes allocation information 
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34 



specifying for said subband the number of bits 
representing the samples of the quantised first and 
second subband signal components derived from 
each of the two subband signals of said subband, 
and in that the third frame portion includes samples 
of said quantised first and second subband signal 
components (If present). 

12. A transmission system as claimed in Claim 
11, when appendant to Claim 7, characterized in 
that the third frame portion includes scale-factor 
information for two scale factors for said subband, 
each scale factor belonging to the first or the 
second quantised subband signal component of 
said subband. 

13. A transmission system as claimed in Claim 
11 or 12, when appendant to Claim 8, in which the 
synthesis filter means are adapted to be respon- 
sive to the respective quantised subband signal 
components to construct a replica of the wideband 
digital signal comprising the first and the second 
signal component, characterized in that the sam- 
ples of the subband signal components (if present) 
are inserted in the third frame portion in a se- 
quence con-esponding to the sequence in which 
the samples of said subband signal components 
are applied to the synthesis filter means upon 
reception in the receiver. 

14. A transmission system as claimed in Claim 

13, characterized in that the allocation information 
for the various quantised subband signal compo- 
nents is inserted in the second frame portion In a 
similar sequence. 

15. A transmission system as claimed in Claim 

14, characterized in that the scale-factor informa- 
tion for the scale factors is inserted in the third 
frame portion in a sequence corresponding to the 
sequence In which the allocation information for the 
first and second quantised subband signal compo- 
nents belonging to said scale factors are inserted 
in the second frame portion, and in that the scale- 
factor information is inserted In the third frame 
portion before the quantised subband signal com- 
ponents. 

16. A transmission system as claimed in Claim 
5, characterized in that the (P' + 1)-st information 
packet does not contain any useful information. 

17. A transmission system as claimed in any 
one of the preceding Claims, characterized in that 
the frames comprise a fourth frame portion In 
which error-detection and/or error-correction infor- 
mation is included. 

18. A transmitter for use in a transmission 
system as claimed in any one of the preceding 
Claims. 

19. A transmitter as claimed in Claim 18, char- 
acterized In that the transmitter takes the form of a 
device for recording the second digital signal in a 
track on a record carrier. 



20. A record carrier obtained by means of the 
transmitter as claimed in Claim 19, characterized in 
that the second digital signal is recorded in the 
track. 

5 21. A receiver for use in a transmission system 

as claimed In any one of the Claims 1 to 16. 

22. A receiver as claimed in Claim 21, char- 
acterized in that the receiver takes the form of a 
device for reading the second digital signal from a 

10 track on a record carrier. 
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